Uncommon information (the cost of exchanging a quantum state) 
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If two parties share an unknown quantum state, one can ask how much quantum communication 
is needed for party A to send her share to party B. Recently, it was found that the number of 
qubits which should be sent is given by the conditional entropy. This quantifies the notion of partial 
information, and it can even be negative. Here, we not only demand that A send her state to 
B, but additionally, B should send his state to A. Paradoxically, we find that requiring that the 
parties perform this additional task can lower the amount of quantum communication required. This 
primitive, which we call quantum state exchange, can be used to quantify the notion of uncommon 
information, since the two parties only need to send each other the parts of their state they don't 
hold in common. In the classical case, the concept of uncommon information follows trivially from 
the concept of partial information. We find that for quantum states, this is not so. We prove upper 
and lower bounds for the uncommon information and find optimal protocols for several classes of 
states. 



We now understand information in operational terms. 
We quantify it in terms of the amount of communication 
required to convey messages. For classical messages rep- 
resented by a probabilistic source producing messages X 
Shannon showed that a rate of H(X) bits are required 
to convey the message, where H{X) = —J2Px^og 2 p x 
is the Shannon entropy Likewise, for a source pro- 
ducing unknown quantum states with density matrix pA, 
Schumacher [2| showed that S(A) quantum bits (qubits) 
are necessary and sufficient to send the states where 
S(A) = — TvpA log Pa is the von Neumann entropy and 
we drop the explicit dependence on p. We thus see that 
the operational notion of information corresponds to cal- 
culable quantities. 

Now, if the receiver has some prior information about 
the messages to be sent, then generally, less bits (or 
qubits) need to be sent. If we represent the receiver's 
prior information by the variable Y, then the Slepian- 
Wolf theorem tells us that H(X\Y) = H(XY)-H(Y) 
bits will convey the message. This quantity is called the 
conditional entropy, and it gives us a notion of how much 
partial information needs to be sent if the receiver has 
some prior information. The quantum counter-part of 
partial information was recently found by Horodecki and 
ourselves, 0, HJ through considering an analogous sce- 
nario we called quantum state merging. Instead of shar- 
ing a random variable XY, two parties (named Alice and 
Bob), share unknown states from an unknown ensemble 
with density matrix pab- We then allow free classical 
communication, and ask how many qubits Alice needs 
to send so that Bob receives her state. This quantifies 
the partial quantum information, and it was shown to be 
S(A\B) = S{AB) - S{B), the quantum conditional en- 
tropy. Such a quantity was known previously, and it had 
been observed that it can be negative for entangled states 
[H, 0, • State- merging shows that it has a meaning in 
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FIG. 1: A graphical representation of the uncommon informa- 
tion (shaded area) in classical information theory. The total 
information of the source producing pairs of random vari- 
ables X, Y is H(XY), while the information contained in just 
variable X (Y) is H(X) (H(Y)). The information common 
to both variables is the mutual information I(X : Y) (un- 
shaded), while the partial informations are the conditional 
entropies H(X\Y) and H(Y\X). In the quantum case, the 
quantum mutual information I(A : B) can be greater than 
the total information S(AB). To compensate, the partial in- 
formations S(A\B) and S(B\A) can be negative. As we show, 
the uncommon information, defined operationally as state- 
exchange, must be positive in the quantum case. It thus can- 
not be the sum of the two partial informations and can not 
appear on this diagram. 



terms of information. The fact that it can be negative 
then becomes natural - the conditional entropy quanti- 
fies how many qubits need to be sent from Alice to Bob, 
and if it is negative, Alice and Bob gain the potential 
to send future quantum states at no cost. Alice can not 
only send her state to Bob, but the parties are addition- 
ally left with maximally entangled states which can be 
used in the future to teleport quantum states without 
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using a quantum channel. 

Finally, in classical information theory, there is the no- 
tion of mutual information - the amount of correlation 
between two variables. This is given by I(X : Y) = 
H(X)+H(Y) — H(XY), which has the operational mean- 
ing as the rate at which messages can be reliably sent 
through a channel which takes X to Y (after maximizing 
I(X : Y) over inputs) The building blocks of classical 
information can thus be represented by the Venn diagram 
of Figure 1. We will see that in quantum information 
theory, the Venn diagram is completely inadequate for 
representing the basic building blocks of the theory, and 
in fact, even entropies appear inadequate. Already, the 
analogous quantity for the mutual information is slightly 
less clear. One analogous task is the sending of quan- 
tum states through a noisy quantum channel, which can 
be done at a rate equal to a quantity called the coher- 
ent information 1(A) B) = S(B) - S(AB). This quantity 
is asymmetric unlike the quantum mutual information 
I(A : B) = S(A) + S(B) - S(AB) (which can also be 
seen as a measure of total correlations - both classical and 
quantum fioj'). Other quantities which might be seen as 
giving meaning to the notion of shared (common) correla- 
tions in quantum states including the entanglement cost 
E c and the distillable entanglement D (how many max- 
imally entangled states are required to create a shared 
state or are obtainable from it). The last one is a nat- 
ural measure of pure quantum common information as 
classical communication is taken to be free for this task. 

Here, we consider a concept which is complementary 
to the mutual information - we thus call it the uncom- 
mon information (since mutual information is sometimes 
refered to as common information. In the classical case, 
we can quantify it by considering two parties, and ask 
how much classical communication is required for them 
to exchange their messages. I.e. if Alice has X and Bob 
Y , how much communication do they need for Bob to get 
X and Alice to get Y. This naturally and operationally 
defines the notion of uncommon information, since they 
will have to transfer to each other the parts of their mes- 
sage which the other party doesn't know about. It is 
also a common communication primitive - most conver- 
sations involve exchanging information. The solution to 
this problem follows immediately by application of the 
Slepian- Wolf theorem - Alice sends H(X\Y) bits to Bob, 
who then has X, and Bob then sends H(Y\X) bits to 
Alice so that she has Y. This nicely divides the to- 
tal information H(XY) into two parts, the mutual in- 
formation I(X : Y), and the uncommon information 
U(X : Y) = H(X\Y) + H(Y\X). We then see that 
S(XY) = I(X : Y) + U(X : Y). 

We now want to find the appropriate quantum coun- 
terpart to the classical uncommon information. As we 
did with state merging, we will consider an operational 
task which is analogous to the classical task - we call it 
quantum state exchange. Namely, Alice and Bob share 



unknown states which are emitted from a source char- 
acterized by density matrix pab ~ they want to swap 
states, and we ask how many qubits they need to send 
in total, while allowing classical communication for free 
(since we are interested in isolating the quantum part of 
the information), also consider the case Unlike the clas- 
sical case, where the Slepian- Wolf theorem allows one 
to quickly solve message exchange, we will see that one 
cannot use state merging to solve this problem - the sit- 
uation is completely different. Indeed this must be so - 
the quantity S(A\B) + S(B\A) can be negative, and if 
it gave the rate for state exchange, Alice and Bob would 
be apply to continually exchange their states, generat- 
ing an arbitrarily large amount of pure entanglement. In 
essence we will see that, quantum state merging does not 
solve the problem of state exchange due to the no-cloning 
theorem [llj. S(A\B) + S(B\A) thus appears to have no 
physical or operational information-theoretic meaning - 
in sharp contrast to the classical case. 

In the remainder of this article, we will more formally 
define the notion of quantum state exchange, and then 
provide several protocols and solve some examples. We 
then provide an upper bound for the quantum uncommon 
information T(A : B) given by E C (R : A) + E C (R : B) 
where the system R purifies pab, i-e. there is a pure state 
Wabr and Tr fi |^) (iP\abr = Pab- S(AB) is also proven 
to be an upper bound. E c is the amount of pure state en- 
tanglement needed to create a state between two parties 
12l 13l | . We shall then prove a lower bound on T(A : B) 
given by D^(R)A) + D^(R)B) where D^(R)T) is the 
one way distillable entanglement with classical communi- 
cation from RtoT only. Another provable lower bound is 
maxA [S^AV) — S(BV)) where the maximization is over 
channels A : R — >V. Stranglely, S(A\B) + S(B\A), the 
minimum rate for Alice to send her state to Bob, plus 
the minimum rate for Bob to send to Alice is not a lower 
bound, and we give examples of states for which S(A\B) 
[or S(B\A)} can be very large while the rate for state 
exchange is small. 

As with Schumacher compression, we consider the case 
of sources producing unknown states, and we know the 
statistics of the source only through the density matrix 
(extension to the case of unknown sources is also possible 
[14|). but we make no assumptions about the ensemble 
of states which may be emitted by the source - the states 
are unknown. We consider two separated parties, in pos- 
session of n copies of state pab- A faithful protocol is 
one which works with high probability, averaged over all 
possible unknown states in an ensemble. An elegant re- 
formulation of this is to consider a reference system i?, 
and total pure state \i/j)abr, and define success of the 
protocols by demanding that after Alice's state has been 
transferred to Bob's site, the total state \iP)abr should 
be virtually unchanged. More formally: 

Definition 1 A faithful state merging protocol from 
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Alice to Bob is an operation that transforms the state 
*I J ABR state Pabb' B n R such that for large n 
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where i)B'B"R is equal to the original state tpABR if we 
substitute A — > B' and B —* B" , and the state 4>ab 
is arbitrary. The fidelity F(p,a) — Tr(\/ ^/ap^/a), and 
subsystems B,B' ,B" are at Bob's site. Allowing classical 
communication for free, the partial information is the 
rate amount of pure entanglement needed to achieve state 
merging (taking into account the entanglement left behind 
in the form of 4>ab )■ 

Typically, the state 4>ab will be the possible distilled pure 
entanglement which might be gained after the protocol. 
Similarly, 

Definition 2 A faithful state exchange protocol be- 
tween Alice to Bob is an operation that transforms the 
state ipABR state pTbr such that for large n 
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where tpBAR is equal to the original state tpABR if we 
exchange A with B (i.e. perform the swap operation). 
Allowing classical communication for free, the uncom- 
mon information T(A : B) is the minimum rate of 
pure entanglement required to achieve state exchange. 

State exchange for the completely mixed density ma- 
trix is equivalent to the swap operation and it was shown 
that two bits of pure entanglement are necessary and suf- 
ficient to perform swap on two qubits TH, T3|. However, 
we will now see that if some correlations exist between 
the two systems, and we have many copies of the state, 
we can actually do better. We first look at upper bounds 
for T(A : B), based on specific protocols for state ex- 
change: 

Merge-and-send protocol: Alice merges her state 
with Bob at a cost of S(A\B), who then sends his state 
to Alice at a cost of S(B). The total rate R ms is thus 
S(AB). Here we see how the no-cloning theorem enters 
into the situation. Alice is able to merge her state with 
Bob's, taking advantage of the fact that he has some prior 
information (the state pb)- She thus sends her state at 
the lowest possible rate. However, once she has sent her 
state, she is left with nothing (unlike in the classical case 
where she can make a clone of her message), and Bob 
must send at the maximum rate of S{B). Since individ- 
ually, each party's partial information can be negative, 
it is the no-cloning theorem which prevents the rate of 
state exchange from being negative. 
Double-copy protocol: Each party coherently copies 
their state to an ancilla: A — > A' A", B — > B'B" in some 
known basis which gets optimized. I.e. Alice applies the 
operation |a)^|0)A' |0)a" — ► |0)A|a)A' \a)A" for some ba- 
sis a, and similarly Bob performs this operation in some 
basis b of his choice. Then: 



1. Alice merges one of her copies of her state to Bob 
at a cost of S(A'\B) 

2. Bob merges one of his copies with Alice at a cost 
of S(B'\A") 

3. Alice merges her second copy with Bob at a cost of 
S(A"\A'B") 

4. Bob merges his second copy with Alice, costing 
S(B"\B') 

The total cost gives the following achievable rate Rdc for 
this protocol 

R dc = min (S(A'B) + S{AB') - S(A') - S(B')) (3) 

ab 

= min fe Pa S(o%) +J2PbS(or A )) 



= Ef(A : R) + Ef(B : R) 



(4) 



where the minimization is taken over bases a, 6, and Ef 
is the entanglement of formation [l2T ] ; regularisation (op- 
timising over many copies of input state) leads to the 
better upper bound by the sum of the entanglement costs 
[jjjj . The states a\ (a B ) are those that would be induced 
on A (B) after a measurement on B (A) with outcomes b 
(a) and probabilities pb (p a )- The measurement basis is 
the same as the basis \a) and \b) chosen in the initial copy- 
ing step. The second equality of Equation (fj|, just comes 
from the fact that S(A') and S(B') are the same as the 
classical entropy H({p a }) and H({pb}). The third equal- 
ity comes from the fact that a measurement on system 
A which minimizes the entropy of system B (conditioned 
on the outcomes of measurements), can alternatively be 
thought of as a measurement which produces pure states 
\ip a )BR ~ i-e. it is a decomposition of pbr into pure states 
\4> a )BR. with minimal total entanglement. 

Expression ([3]) can also be interpreted as the sum of 
two classical-quantum conditional entropies, each of the 
form 

SH{A\B) = mi{S{AK{B)) - S{K{B))). (5) 

I.e., conditional entropies obtained after applying the de- 
cohering map A on the conditioning system as is done in 
the protocol. 



Note also that we can express this rate [17[ in terms of 
a measurement of classical correlations, the Henderson- 
Vedral quantity [18( which we regularize Cff v 

R dc = S(A) + S(B) - C^ V (A)B) - C% V (B)A). (6) 

Cff v is operationally equal to the one-way distillablc 
common randomness . Given that we have here a very 
simple protocol involving only four rounds, it seems pos- 
sible that a more complicated protocol with many rounds 
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may be related not to CS V but perhaps the classical in- 
formation deficit A c i [1 0j | . 

Modified double copy protocol: One can modify the 
preceding protocol slightly, by not required the two par- 
ties to perform a complete copying operation. Each can 
divide their states into parts which are copied, and parts 
which are merged. We will see that this can be better. 

The double copy protocol is optimal for the so-called 
classical states, i.e. states of the form 

Qi B =J2Pab\a){a\®\b){b\ (7) 

since for these states one can copy in the local eigenbasis 
without changing any of the entropies. One can thus 
achieve a rate of S(A\B) + S(B\A) which is optimal due 
to the lower bound of Theorem [2] proven below. The 
modified double copy protocol appears to be optimal for 
one-sided classical states, for example those of the form 

9%=^^^®^ ( g ) 

with the classical part on Alice's side. In this case, Alice 
copies and merges one of her copies with Bob who then 
merges his entire state with Alice's second copy. Alice 
then merges this second copy, again achieving the rate 
of S(A\B) + S(B\A). That S(B\A) qubits need to be 
sent from Bob follows from Theorem [2] proven below and 
we suspect that S(A\B) bits also needs to be sent from 
Alice. 

We finally mention another protocol: do nothing. 
This is possible (and clearly optimal) for states which are 
supported on only the symmetric (or antisymmetric) sub- 
space of Alice and Bob, or are locally equivalent to such 
states. For symmetric or antisymmetric states on AB, 
\4>)abr = ±|^).bar> and thus nothing needs to be done 
since a global phase does not influence the three-party 
density operator. In particular, all pure states on AB 
have T(^4 : B) = 0, as one would expect: they are fully 
correlated in the Schmidt basis, and thus contain no un- 
common information. Surprisingly, neither S(A\B) nor 
S(B\A) are zero for such (anti-)symmetric states, thus if 
the task is for Alice to send her state to Bob, she needs 
S(A\B) bits of entanglement, while if we demand that 
they perform the additinal task of Bob sending his state 
to Alice, the task becomes easier, and no quantum or 
classical communication needs to be exchanged. In the 
classical case, this of course never happens. 

By exhibiting particular protocols, we thus obtain 

Theorem 1 The uncommon information T(A : B) is 
upper-bounded T(A : B) < S{AB) and T(A : B) < 
E C (A:R) + E C (B:R). 

We now turn to lower bounds, and will prove 



Theorem 2 The uncommon information T(A : B) sat- 
isfies the following lower bounds: 

T(A : B) > D^{R)A) + D^(R)B) and 
T(A : B) > max \S(BV) - S(AV)] , 

A 

where the maximization is over channels A : R — > V . 

The proof of Theorem [2] is straightforward - for the 
first inequality, we imagine R as a referee who will check 
to see whether the output state p e A x BR is close in fidelity 
to \iP)abb,- Before Alice and Bob begin the protocol, the 
referee performs one way distillation with Alice or Bob 
by performing local operations on her state. To distill 
maximally entangled states she would normally commu- 
nicate with the other party, but this isn't necessary - it is 
only used to tell Alice or Bob which parts of their state 
contain the distilled entanglement. From the referee's 
perspective she holds D^(R)A) (or D^(R)B)) bits of 
pure entanglement with A (or B). Imagine she distills 
entanglement (i.e. state with Alice. Then clearly 
on RA must be transferred by Alice to Bob, since the 
referee can check after completion of the protocol by ask- 
ing Bob for the appropriate bits, and checking the fidelity 
of the subsystem of Pa x br which contains tp + . However, 
Alice and Bob perform their protocol before they know 
which party the referee distilled with, and thus from their 
point of view, pure state entanglement with R may ex- 
ist on both their states which needs to be transferred 
to the other party. Thus just as Alice needs to transfer 
D^(R)A) to Bob, Bob also needs to transfer D^(R)B) 
qubits to Alice in case the referee distilled entanglement 
with him. 

The second inequality comes by imagining dividing R 
into two parts, E and V (which is equivalent to a chan- 
nel with E treated as the channel's environment), and 
giving pv to Alice and pe to Bob. Before the protocol, 
the entanglement is S(AV) and after the state exchange 
the entanglement is S(BV). Since entanglement can- 
not increase more than the number of qubits exchanged, 
the difference between final and initial entanglement is a 
lower bound on the number of sent qubits. Optimizing 
over splittings of pr gives the required bound. □ 

Let us turn from uncommon information to the no- 
tion of common information (which can be taken as 
a more general notion than mutual information [||). 
State exchange considerations suggest that it be given 
by C(A : B) = S{AB) -T(A : B), i.e. the uncommon 
information subtracted from the total information. Such 
a quantity is always positive by Theorem [TJ but is very 
different from the mutual information (for example, it is 
zero for pure states) . Part of the reason for this is that 
mutual information measures the correlations between 
the two parties, while the common information quanti- 
fies how much information about a reference system the 
two parties share in common. It is thus zero for pure 
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states because a pure state has no information about the 
reference system, while it is maximal for the symmetric 
states, where all the information is common. It would be 
interesting to explore this notion of common information 
further, especially compared with other notions such as 
the mutual information, coherent information, distillablc 
entanglement and entanglement of formation. 
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